1. Initial Agent's State

2. Agent's state after 1K steps

3. Agent's state after 5 million steps

4. Agent's state after 400 million steps

5. Final Agent's state (after 600 million steps)